Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 3961 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 402.4 KiB |
| Average record size in memory | 104.0 B |
Variable types
| NUM | 13 |
|---|
df_index has unique values | Unique |
Reproduction
| Analysis started | 2020-11-11 20:10:16.389716 |
|---|---|
| Analysis finished | 2020-11-11 20:11:15.652659 |
| Duration | 59.26 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 3961 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2414.467054 |
|---|---|
| Minimum | 0 |
| Maximum | 4897 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 30.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 237 |
| Q1 | 1170 |
| median | 2385 |
| Q3 | 3641 |
| 95-th percentile | 4661 |
| Maximum | 4897 |
| Range | 4897 |
| Interquartile range (IQR) | 2471 |
Descriptive statistics
| Standard deviation | 1426.02502 |
|---|---|
| Coefficient of variation (CV) | 0.5906168892 |
| Kurtosis | -1.207715853 |
| Mean | 2414.467054 |
| Median Absolute Deviation (MAD) | 1235 |
| Skewness | 0.04938645192 |
| Sum | 9563704 |
| Variance | 2033547.359 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 2600 | 1 | < 0.1% | |
| 2596 | 1 | < 0.1% | |
| 4643 | 1 | < 0.1% | |
| 545 | 1 | < 0.1% | |
| 2592 | 1 | < 0.1% | |
| 4639 | 1 | < 0.1% | |
| 541 | 1 | < 0.1% | |
| 4635 | 1 | < 0.1% | |
| 4631 | 1 | < 0.1% | |
| Other values (3951) | 3951 | 99.7% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4897 | 1 | < 0.1% | |
| 4896 | 1 | < 0.1% | |
| 4895 | 1 | < 0.1% | |
| 4894 | 1 | < 0.1% | |
| 4893 | 1 | < 0.1% |
fa
Real number (ℝ≥0)
| Distinct | 68 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.839346125 |
|---|---|
| Minimum | 3.8 |
| Maximum | 14.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.9 KiB |
Quantile statistics
| Minimum | 3.8 |
|---|---|
| 5-th percentile | 5.6 |
| Q1 | 6.3 |
| median | 6.8 |
| Q3 | 7.3 |
| 95-th percentile | 8.3 |
| Maximum | 14.2 |
| Range | 10.4 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8668597405 |
|---|---|
| Coefficient of variation (CV) | 0.1267459966 |
| Kurtosis | 2.253047398 |
| Mean | 6.839346125 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 0.6961002189 |
| Sum | 27090.65 |
| Variance | 0.7514458097 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 6.8 | 241 | 6.1% | |
| 6.6 | 238 | 6.0% | |
| 6.4 | 224 | 5.7% | |
| 6.9 | 191 | 4.8% | |
| 6.7 | 190 | 4.8% | |
| 6.5 | 182 | 4.6% | |
| 7 | 179 | 4.5% | |
| 6.2 | 159 | 4.0% | |
| 6.3 | 158 | 4.0% | |
| 7.1 | 154 | 3.9% | |
| Other values (58) | 2045 | 51.6% |
| Value | Count | Frequency (%) | |
| 3.8 | 1 | < 0.1% | |
| 3.9 | 1 | < 0.1% | |
| 4.2 | 2 | 0.1% | |
| 4.4 | 3 | 0.1% | |
| 4.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 14.2 | 1 | < 0.1% | |
| 11.8 | 1 | < 0.1% | |
| 10.7 | 1 | < 0.1% | |
| 10.3 | 2 | 0.1% | |
| 10.2 | 1 | < 0.1% |
va
Real number (ℝ≥0)
| Distinct | 125 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.280537743 |
|---|---|
| Minimum | 0.08 |
| Maximum | 1.1 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.9 KiB |
Quantile statistics
| Minimum | 0.08 |
|---|---|
| 5-th percentile | 0.15 |
| Q1 | 0.21 |
| median | 0.26 |
| Q3 | 0.33 |
| 95-th percentile | 0.46 |
| Maximum | 1.1 |
| Range | 1.02 |
| Interquartile range (IQR) | 0.12 |
Descriptive statistics
| Standard deviation | 0.103437087 |
|---|---|
| Coefficient of variation (CV) | 0.3687100562 |
| Kurtosis | 5.327754 |
| Mean | 0.280537743 |
| Median Absolute Deviation (MAD) | 0.06 |
| Skewness | 1.641080979 |
| Sum | 1111.21 |
| Variance | 0.01069923096 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.28 | 213 | 5.4% | |
| 0.24 | 208 | 5.3% | |
| 0.26 | 207 | 5.2% | |
| 0.25 | 179 | 4.5% | |
| 0.22 | 178 | 4.5% | |
| 0.2 | 175 | 4.4% | |
| 0.27 | 175 | 4.4% | |
| 0.23 | 173 | 4.4% | |
| 0.21 | 158 | 4.0% | |
| 0.3 | 154 | 3.9% | |
| Other values (115) | 2141 | 54.1% |
| Value | Count | Frequency (%) | |
| 0.08 | 2 | 0.1% | |
| 0.085 | 1 | < 0.1% | |
| 0.09 | 1 | < 0.1% | |
| 0.1 | 6 | 0.2% | |
| 0.105 | 4 | 0.1% |
| Value | Count | Frequency (%) | |
| 1.1 | 1 | < 0.1% | |
| 1.005 | 1 | < 0.1% | |
| 0.965 | 1 | < 0.1% | |
| 0.93 | 1 | < 0.1% | |
| 0.91 | 1 | < 0.1% |
ca
Real number (ℝ≥0)
| Distinct | 87 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3343322393 |
|---|---|
| Minimum | 0 |
| Maximum | 1.66 |
| Zeros | 18 |
| Zeros (%) | 0.5% |
| Memory size | 30.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.17 |
| Q1 | 0.27 |
| median | 0.32 |
| Q3 | 0.39 |
| 95-th percentile | 0.53 |
| Maximum | 1.66 |
| Range | 1.66 |
| Interquartile range (IQR) | 0.12 |
Descriptive statistics
| Standard deviation | 0.1224460908 |
|---|---|
| Coefficient of variation (CV) | 0.3662407521 |
| Kurtosis | 6.84480817 |
| Mean | 0.3343322393 |
| Median Absolute Deviation (MAD) | 0.06 |
| Skewness | 1.310601017 |
| Sum | 1324.29 |
| Variance | 0.01499304514 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.3 | 239 | 6.0% | |
| 0.28 | 220 | 5.6% | |
| 0.32 | 214 | 5.4% | |
| 0.34 | 181 | 4.6% | |
| 0.29 | 179 | 4.5% | |
| 0.26 | 173 | 4.4% | |
| 0.49 | 173 | 4.4% | |
| 0.27 | 164 | 4.1% | |
| 0.31 | 162 | 4.1% | |
| 0.33 | 155 | 3.9% | |
| Other values (77) | 2101 | 53.0% |
| Value | Count | Frequency (%) | |
| 0 | 18 | 0.5% | |
| 0.01 | 6 | 0.2% | |
| 0.02 | 6 | 0.2% | |
| 0.03 | 2 | 0.1% | |
| 0.04 | 10 | 0.3% |
| Value | Count | Frequency (%) | |
| 1.66 | 1 | < 0.1% | |
| 1.23 | 1 | < 0.1% | |
| 1 | 5 | 0.1% | |
| 0.99 | 1 | < 0.1% | |
| 0.91 | 1 | < 0.1% |
rs
Real number (ℝ≥0)
| Distinct | 310 |
|---|---|
| Distinct (%) | 7.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.91481949 |
|---|---|
| Minimum | 0.6 |
| Maximum | 65.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.9 KiB |
Quantile statistics
| Minimum | 0.6 |
|---|---|
| 5-th percentile | 1.1 |
| Q1 | 1.6 |
| median | 4.7 |
| Q3 | 8.9 |
| 95-th percentile | 15.2 |
| Maximum | 65.8 |
| Range | 65.2 |
| Interquartile range (IQR) | 7.3 |
Descriptive statistics
| Standard deviation | 4.861646308 |
|---|---|
| Coefficient of variation (CV) | 0.8219433097 |
| Kurtosis | 5.681512166 |
| Mean | 5.91481949 |
| Median Absolute Deviation (MAD) | 3.2 |
| Skewness | 1.333639018 |
| Sum | 23428.6 |
| Variance | 23.63560482 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1.4 | 165 | 4.2% | |
| 1.2 | 165 | 4.2% | |
| 1.6 | 144 | 3.6% | |
| 1.3 | 134 | 3.4% | |
| 1.1 | 126 | 3.2% | |
| 1.5 | 125 | 3.2% | |
| 1.7 | 87 | 2.2% | |
| 1.8 | 85 | 2.1% | |
| 1 | 77 | 1.9% | |
| 2 | 67 | 1.7% | |
| Other values (300) | 2786 | 70.3% |
| Value | Count | Frequency (%) | |
| 0.6 | 1 | < 0.1% | |
| 0.7 | 7 | 0.2% | |
| 0.8 | 25 | 0.6% | |
| 0.9 | 35 | 0.9% | |
| 0.95 | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 65.8 | 1 | < 0.1% | |
| 31.6 | 1 | < 0.1% | |
| 26.05 | 1 | < 0.1% | |
| 23.5 | 1 | < 0.1% | |
| 22.6 | 1 | < 0.1% |
chlorides
Real number (ℝ≥0)
| Distinct | 160 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.04590507448 |
|---|---|
| Minimum | 0.009 |
| Maximum | 0.346 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.9 KiB |
Quantile statistics
| Minimum | 0.009 |
|---|---|
| 5-th percentile | 0.027 |
| Q1 | 0.035 |
| median | 0.042 |
| Q3 | 0.05 |
| 95-th percentile | 0.069 |
| Maximum | 0.346 |
| Range | 0.337 |
| Interquartile range (IQR) | 0.015 |
Descriptive statistics
| Standard deviation | 0.0231027148 |
|---|---|
| Coefficient of variation (CV) | 0.5032714807 |
| Kurtosis | 35.53028798 |
| Mean | 0.04590507448 |
| Median Absolute Deviation (MAD) | 0.007 |
| Skewness | 4.969076318 |
| Sum | 181.83 |
| Variance | 0.0005337354313 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.036 | 165 | 4.2% | |
| 0.042 | 155 | 3.9% | |
| 0.044 | 155 | 3.9% | |
| 0.046 | 152 | 3.8% | |
| 0.04 | 152 | 3.8% | |
| 0.047 | 145 | 3.7% | |
| 0.038 | 140 | 3.5% | |
| 0.034 | 137 | 3.5% | |
| 0.037 | 136 | 3.4% | |
| 0.048 | 135 | 3.4% | |
| Other values (150) | 2489 | 62.8% |
| Value | Count | Frequency (%) | |
| 0.009 | 1 | < 0.1% | |
| 0.012 | 1 | < 0.1% | |
| 0.013 | 1 | < 0.1% | |
| 0.014 | 4 | 0.1% | |
| 0.015 | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 0.346 | 1 | < 0.1% | |
| 0.301 | 1 | < 0.1% | |
| 0.29 | 1 | < 0.1% | |
| 0.271 | 1 | < 0.1% | |
| 0.255 | 1 | < 0.1% |
fsd
Real number (ℝ≥0)
| Distinct | 132 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.8891694 |
|---|---|
| Minimum | 2 |
| Maximum | 289 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.9 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 23 |
| median | 33 |
| Q3 | 45 |
| 95-th percentile | 63 |
| Maximum | 289 |
| Range | 287 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 17.21002061 |
|---|---|
| Coefficient of variation (CV) | 0.4932768795 |
| Kurtosis | 13.43402487 |
| Mean | 34.8891694 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 1.56668022 |
| Sum | 138196 |
| Variance | 296.1848094 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 29 | 125 | 3.2% | |
| 31 | 110 | 2.8% | |
| 34 | 107 | 2.7% | |
| 26 | 105 | 2.7% | |
| 36 | 102 | 2.6% | |
| 24 | 101 | 2.5% | |
| 35 | 97 | 2.4% | |
| 28 | 95 | 2.4% | |
| 23 | 93 | 2.3% | |
| 25 | 92 | 2.3% | |
| Other values (122) | 2934 | 74.1% |
| Value | Count | Frequency (%) | |
| 2 | 1 | < 0.1% | |
| 3 | 9 | 0.2% | |
| 4 | 9 | 0.2% | |
| 5 | 23 | 0.6% | |
| 6 | 29 | 0.7% |
| Value | Count | Frequency (%) | |
| 289 | 1 | < 0.1% | |
| 146.5 | 1 | < 0.1% | |
| 138.5 | 1 | < 0.1% | |
| 131 | 1 | < 0.1% | |
| 128 | 1 | < 0.1% |
tsd
Real number (ℝ≥0)
| Distinct | 251 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 137.1935117 |
|---|---|
| Minimum | 9 |
| Maximum | 440 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.9 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 73 |
| Q1 | 106 |
| median | 133 |
| Q3 | 166 |
| 95-th percentile | 212 |
| Maximum | 440 |
| Range | 431 |
| Interquartile range (IQR) | 60 |
Descriptive statistics
| Standard deviation | 43.12906524 |
|---|---|
| Coefficient of variation (CV) | 0.314366654 |
| Kurtosis | 0.7352578602 |
| Mean | 137.1935117 |
| Median Absolute Deviation (MAD) | 29 |
| Skewness | 0.4567996771 |
| Sum | 543423.5 |
| Variance | 1860.116268 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 111 | 51 | 1.3% | |
| 114 | 47 | 1.2% | |
| 113 | 46 | 1.2% | |
| 122 | 45 | 1.1% | |
| 128 | 45 | 1.1% | |
| 117 | 44 | 1.1% | |
| 150 | 43 | 1.1% | |
| 126 | 42 | 1.1% | |
| 98 | 41 | 1.0% | |
| 118 | 41 | 1.0% | |
| Other values (241) | 3516 | 88.8% |
| Value | Count | Frequency (%) | |
| 9 | 1 | < 0.1% | |
| 10 | 1 | < 0.1% | |
| 18 | 1 | < 0.1% | |
| 19 | 1 | < 0.1% | |
| 21 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 440 | 1 | < 0.1% | |
| 366.5 | 1 | < 0.1% | |
| 344 | 1 | < 0.1% | |
| 313 | 1 | < 0.1% | |
| 307.5 | 1 | < 0.1% |
density
Real number (ℝ≥0)
| Distinct | 890 |
|---|---|
| Distinct (%) | 22.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.9937895304 |
|---|---|
| Minimum | 0.98711 |
| Maximum | 1.03898 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.9 KiB |
Quantile statistics
| Minimum | 0.98711 |
|---|---|
| 5-th percentile | 0.98961 |
| Q1 | 0.99162 |
| median | 0.9935 |
| Q3 | 0.99571 |
| 95-th percentile | 0.9986 |
| Maximum | 1.03898 |
| Range | 0.05187 |
| Interquartile range (IQR) | 0.00409 |
Descriptive statistics
| Standard deviation | 0.002904595778 |
|---|---|
| Coefficient of variation (CV) | 0.002922747412 |
| Kurtosis | 14.18489211 |
| Mean | 0.9937895304 |
| Median Absolute Deviation (MAD) | 0.00206 |
| Skewness | 1.273317861 |
| Sum | 3936.40033 |
| Variance | 8.436676636e-06 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.992 | 60 | 1.5% | |
| 0.9928 | 52 | 1.3% | |
| 0.9932 | 49 | 1.2% | |
| 0.9934 | 46 | 1.2% | |
| 0.993 | 46 | 1.2% | |
| 0.9938 | 43 | 1.1% | |
| 0.9944 | 41 | 1.0% | |
| 0.9927 | 40 | 1.0% | |
| 0.9924 | 39 | 1.0% | |
| 0.9954 | 37 | 0.9% | |
| Other values (880) | 3508 | 88.6% |
| Value | Count | Frequency (%) | |
| 0.98711 | 1 | < 0.1% | |
| 0.98713 | 1 | < 0.1% | |
| 0.98722 | 1 | < 0.1% | |
| 0.9874 | 1 | < 0.1% | |
| 0.98742 | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 1.03898 | 1 | < 0.1% | |
| 1.0103 | 1 | < 0.1% | |
| 1.00295 | 1 | < 0.1% | |
| 1.00241 | 1 | < 0.1% | |
| 1.0024 | 1 | < 0.1% |
pH
Real number (ℝ≥0)
| Distinct | 103 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.195458218 |
|---|---|
| Minimum | 2.72 |
| Maximum | 3.82 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.9 KiB |
Quantile statistics
| Minimum | 2.72 |
|---|---|
| 5-th percentile | 2.96 |
| Q1 | 3.09 |
| median | 3.18 |
| Q3 | 3.29 |
| 95-th percentile | 3.46 |
| Maximum | 3.82 |
| Range | 1.1 |
| Interquartile range (IQR) | 0.2 |
Descriptive statistics
| Standard deviation | 0.1515455672 |
|---|---|
| Coefficient of variation (CV) | 0.0474253008 |
| Kurtosis | 0.5499570349 |
| Mean | 3.195458218 |
| Median Absolute Deviation (MAD) | 0.1 |
| Skewness | 0.455456831 |
| Sum | 12657.21 |
| Variance | 0.02296605892 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 3.16 | 128 | 3.2% | |
| 3.14 | 127 | 3.2% | |
| 3.22 | 119 | 3.0% | |
| 3.19 | 115 | 2.9% | |
| 3.15 | 114 | 2.9% | |
| 3.18 | 114 | 2.9% | |
| 3.24 | 114 | 2.9% | |
| 3.12 | 111 | 2.8% | |
| 3.2 | 111 | 2.8% | |
| 3.1 | 109 | 2.8% | |
| Other values (93) | 2799 | 70.7% |
| Value | Count | Frequency (%) | |
| 2.72 | 1 | < 0.1% | |
| 2.74 | 1 | < 0.1% | |
| 2.77 | 1 | < 0.1% | |
| 2.79 | 2 | 0.1% | |
| 2.8 | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 3.82 | 1 | < 0.1% | |
| 3.81 | 1 | < 0.1% | |
| 3.8 | 2 | 0.1% | |
| 3.79 | 1 | < 0.1% | |
| 3.77 | 2 | 0.1% |
sulphates
Real number (ℝ≥0)
| Distinct | 79 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4903509215 |
|---|---|
| Minimum | 0.22 |
| Maximum | 1.08 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.9 KiB |
Quantile statistics
| Minimum | 0.22 |
|---|---|
| 5-th percentile | 0.34 |
| Q1 | 0.41 |
| median | 0.48 |
| Q3 | 0.55 |
| 95-th percentile | 0.7 |
| Maximum | 1.08 |
| Range | 0.86 |
| Interquartile range (IQR) | 0.14 |
Descriptive statistics
| Standard deviation | 0.1135228053 |
|---|---|
| Coefficient of variation (CV) | 0.2315133924 |
| Kurtosis | 1.565020602 |
| Mean | 0.4903509215 |
| Median Absolute Deviation (MAD) | 0.07 |
| Skewness | 0.9378533357 |
| Sum | 1942.28 |
| Variance | 0.01288742733 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.5 | 191 | 4.8% | |
| 0.46 | 182 | 4.6% | |
| 0.44 | 171 | 4.3% | |
| 0.38 | 165 | 4.2% | |
| 0.45 | 148 | 3.7% | |
| 0.47 | 144 | 3.6% | |
| 0.42 | 144 | 3.6% | |
| 0.48 | 142 | 3.6% | |
| 0.54 | 136 | 3.4% | |
| 0.4 | 134 | 3.4% | |
| Other values (69) | 2404 | 60.7% |
| Value | Count | Frequency (%) | |
| 0.22 | 1 | < 0.1% | |
| 0.23 | 1 | < 0.1% | |
| 0.25 | 4 | 0.1% | |
| 0.26 | 3 | 0.1% | |
| 0.27 | 10 | 0.3% |
| Value | Count | Frequency (%) | |
| 1.08 | 1 | < 0.1% | |
| 1.06 | 1 | < 0.1% | |
| 1.01 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 0.99 | 1 | < 0.1% |
alcohol
Real number (ℝ≥0)
| Distinct | 103 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.58935791 |
|---|---|
| Minimum | 8 |
| Maximum | 14.2 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.9 KiB |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 8.9 |
| Q1 | 9.5 |
| median | 10.4 |
| Q3 | 11.4 |
| 95-th percentile | 12.8 |
| Maximum | 14.2 |
| Range | 6.2 |
| Interquartile range (IQR) | 1.9 |
Descriptive statistics
| Standard deviation | 1.217076311 |
|---|---|
| Coefficient of variation (CV) | 0.1149339103 |
| Kurtosis | -0.695979718 |
| Mean | 10.58935791 |
| Median Absolute Deviation (MAD) | 0.9 |
| Skewness | 0.450696598 |
| Sum | 41944.44667 |
| Variance | 1.481274748 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 9.5 | 177 | 4.5% | |
| 9.4 | 169 | 4.3% | |
| 10 | 143 | 3.6% | |
| 9.2 | 140 | 3.5% | |
| 10.5 | 140 | 3.5% | |
| 11 | 130 | 3.3% | |
| 10.4 | 130 | 3.3% | |
| 10.8 | 119 | 3.0% | |
| 9 | 116 | 2.9% | |
| 10.2 | 116 | 2.9% | |
| Other values (93) | 2581 | 65.2% |
| Value | Count | Frequency (%) | |
| 8 | 2 | 0.1% | |
| 8.4 | 2 | 0.1% | |
| 8.5 | 9 | 0.2% | |
| 8.6 | 16 | 0.4% | |
| 8.7 | 46 | 1.2% |
| Value | Count | Frequency (%) | |
| 14.2 | 1 | < 0.1% | |
| 14.05 | 1 | < 0.1% | |
| 14 | 5 | 0.1% | |
| 13.9 | 3 | 0.1% | |
| 13.8 | 2 | 0.1% |
quality
Real number (ℝ≥0)
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.854834638 |
|---|---|
| Minimum | 3 |
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 30.9 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 5 |
| median | 6 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 9 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8906826795 |
|---|---|
| Coefficient of variation (CV) | 0.152127726 |
| Kurtosis | 0.2993451703 |
| Mean | 5.854834638 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.1120040345 |
| Sum | 23191 |
| Variance | 0.7933156355 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 6 | 1788 | 45.1% | |
| 5 | 1175 | 29.7% | |
| 7 | 689 | 17.4% | |
| 4 | 153 | 3.9% | |
| 8 | 131 | 3.3% | |
| 3 | 20 | 0.5% | |
| 9 | 5 | 0.1% |
| Value | Count | Frequency (%) | |
| 3 | 20 | 0.5% | |
| 4 | 153 | 3.9% | |
| 5 | 1175 | 29.7% | |
| 6 | 1788 | 45.1% | |
| 7 | 689 | 17.4% |
| Value | Count | Frequency (%) | |
| 9 | 5 | 0.1% | |
| 8 | 131 | 3.3% | |
| 7 | 689 | 17.4% | |
| 6 | 1788 | 45.1% | |
| 5 | 1175 | 29.7% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | fa | va | ca | rs | chlorides | fsd | tsd | density | pH | sulphates | alcohol | quality | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 7.0 | 0.27 | 0.36 | 20.70 | 0.045 | 45.0 | 170.0 | 1.0010 | 3.00 | 0.45 | 8.8 | 6 |
| 1 | 1 | 6.3 | 0.30 | 0.34 | 1.60 | 0.049 | 14.0 | 132.0 | 0.9940 | 3.30 | 0.49 | 9.5 | 6 |
| 2 | 2 | 8.1 | 0.28 | 0.40 | 6.90 | 0.050 | 30.0 | 97.0 | 0.9951 | 3.26 | 0.44 | 10.1 | 6 |
| 3 | 3 | 7.2 | 0.23 | 0.32 | 8.50 | 0.058 | 47.0 | 186.0 | 0.9956 | 3.19 | 0.40 | 9.9 | 6 |
| 4 | 6 | 6.2 | 0.32 | 0.16 | 7.00 | 0.045 | 30.0 | 136.0 | 0.9949 | 3.18 | 0.47 | 9.6 | 6 |
| 5 | 9 | 8.1 | 0.22 | 0.43 | 1.50 | 0.044 | 28.0 | 129.0 | 0.9938 | 3.22 | 0.45 | 11.0 | 6 |
| 6 | 10 | 8.1 | 0.27 | 0.41 | 1.45 | 0.033 | 11.0 | 63.0 | 0.9908 | 2.99 | 0.56 | 12.0 | 5 |
| 7 | 11 | 8.6 | 0.23 | 0.40 | 4.20 | 0.035 | 17.0 | 109.0 | 0.9947 | 3.14 | 0.53 | 9.7 | 5 |
| 8 | 12 | 7.9 | 0.18 | 0.37 | 1.20 | 0.040 | 16.0 | 75.0 | 0.9920 | 3.18 | 0.63 | 10.8 | 5 |
| 9 | 13 | 6.6 | 0.16 | 0.40 | 1.50 | 0.044 | 48.0 | 143.0 | 0.9912 | 3.54 | 0.52 | 12.4 | 7 |
Last rows
| df_index | fa | va | ca | rs | chlorides | fsd | tsd | density | pH | sulphates | alcohol | quality | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3951 | 4888 | 6.8 | 0.220 | 0.36 | 1.20 | 0.052 | 38.0 | 127.0 | 0.99330 | 3.04 | 0.54 | 9.2 | 5 |
| 3952 | 4889 | 4.9 | 0.235 | 0.27 | 11.75 | 0.030 | 34.0 | 118.0 | 0.99540 | 3.07 | 0.50 | 9.4 | 6 |
| 3953 | 4890 | 6.1 | 0.340 | 0.29 | 2.20 | 0.036 | 25.0 | 100.0 | 0.98938 | 3.06 | 0.44 | 11.8 | 6 |
| 3954 | 4891 | 5.7 | 0.210 | 0.32 | 0.90 | 0.038 | 38.0 | 121.0 | 0.99074 | 3.24 | 0.46 | 10.6 | 6 |
| 3955 | 4892 | 6.5 | 0.230 | 0.38 | 1.30 | 0.032 | 29.0 | 112.0 | 0.99298 | 3.29 | 0.54 | 9.7 | 5 |
| 3956 | 4893 | 6.2 | 0.210 | 0.29 | 1.60 | 0.039 | 24.0 | 92.0 | 0.99114 | 3.27 | 0.50 | 11.2 | 6 |
| 3957 | 4894 | 6.6 | 0.320 | 0.36 | 8.00 | 0.047 | 57.0 | 168.0 | 0.99490 | 3.15 | 0.46 | 9.6 | 5 |
| 3958 | 4895 | 6.5 | 0.240 | 0.19 | 1.20 | 0.041 | 30.0 | 111.0 | 0.99254 | 2.99 | 0.46 | 9.4 | 6 |
| 3959 | 4896 | 5.5 | 0.290 | 0.30 | 1.10 | 0.022 | 20.0 | 110.0 | 0.98869 | 3.34 | 0.38 | 12.8 | 7 |
| 3960 | 4897 | 6.0 | 0.210 | 0.38 | 0.80 | 0.020 | 22.0 | 98.0 | 0.98941 | 3.26 | 0.32 | 11.8 | 6 |